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The present invention relates to a. memory device having a shift register 
fintetionajity, suchas.HFO JtofcfcW* memory, which can be t^ for coupling 
one or more data sources to a plurality of data processing devices. 

Densities and transfer rates in modem storage systems are steadily increasing 
One of the consequences of mis are increased amplitude and phase distortions hampering me 
read-out stgnal. To sustain an acceptable error rate under these cimumstances, advanced 
srgnal processing techniques are required Partial-Response Maximum-Ukelfoood (PRML) 
receivers proved to be exttemejy efficient for a very wide scale of applications ranging ftom 
(magneto-) optical disc systems and ham drives to satellite and mobile communication 
systems. PRML-based detection is used in almost all high-end storage systems nowadays 
The use of PRML-based receivers allows to significarttly increase me system capacity and/or 
margms or, equivalent^, decrease coste while keeping me capacity and/or margins 
unchanged, m particular, inttoducmg the PRML-based detection in advanced optical disc 
^sterns, atch as DVD+RW (Read and Writeable Digital Versatile Disc) and DVR (Blue-ray 
Drsc), ts shown to decrease system sensitivity to noise, defbcus end disc titt. The PRML- 
based detection allows to relax me requirement on me mechanical and option! system 
components and, as a resutt, to achieve higher system robustoess and lower production costs 
The so-called Viterbi algorithm is considered to be the most efficient practical 
nnplementation of me PRML detector. However, the speed of the Vtterbi algorithm Is 
urttmsically limited because of fhe data-dependent feedback loop i, contains. An Add- 
Compare-Selec, (ACS) operation has to be executed every clock cycle, and me next cycle 
cannot be started before the result of the ACS operation is available. 

Apart ftom increasing the raw clock ftequency, an alternative approachhas 
been suggested to impmve foe ftroughpu. of a Viterbi detector. Tbis alternative approach is 
based on foe idea of splitting an incoming data stream into sevend sub^treams and 
processing foem in pandlel. However, a parallel implementation of foe Viterbi decoder for 

*^a'^ Speed ftom a digital storage or transmission system, such as an optical disc 
' 1 -'-«l»..J-^^ B i,. rf ^ -1 , IlillBM11B| ^ ■ 
clock mte digital hardware. Such Mgh clock rate digftal hardware results for example from 
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the requirement of having the different parallel data streams buffered in FIFO memory 
blocks. Attempts to combine the different FIFOs in one embedded block lead to a demand for 
a multi-port memory, if the sum of the sample rates of the FIFOs is higher than the system 
clock. This is the case in many signal or image processing systems, storage systems as well 
as co mmunic ation systems. 

In terms of area and power consumption as well as silicon implementation 
aspects, one unified memory block of capacity ZxBis usually cheaper than Z memory 
blocks of capacity B, so that it would be preferable to use one memory of a bigger capacity 
than many small ones. Additionally, single-port memories of the same capacity are cheaper 
than their multi-port counterparts. Consequently, it is preferable to use single-port memories 
instead of multi-port ones. 

It is therefore an object of the present invention to provide a memory 
architecture which enables replacement of several small FIFO memory blocks by one unified 
embedded memory block. 

This object is achieved by a memory device as claimed in claim 1. 
Accordingly, the proposed architecture in which shift register accesses to 
predetermine register memory blocks are mapped to subsequent addresses in the global 
address space of the memory, allows to combine a plurality of FIFOs or other kinds of shift 
register memories in one embedded memory, such as a RAM (Random Access Memory). 
This implementation is most favourable in view of power consumption and area. 
Furthermore, the favourable implementation allows to replace a multi-port RAM by a single- 
port RAM of the same capacity. Such an implementation is not only even more favourable in 
terms of area and power, but also has a reduced cycle and access time. 

The access control means may comprise at least one address counter or any 
kind of finite state machine which generates the required address sequence. Thereby, a 
simple implementation for address generation can be provided. 

Furthermore, the address ranges of the predetermined register memory 
sections may comprise overlapping regions of a predetermined size. This provides the 
advantage that the capacity of the embedded memory is less than the total number of memory 
I locations of all replaced FIFO blocks. These overlapping regions can be advantageously used 
as teaming regions for Viterbi detectors, where an mcoming data stream is splitted between 
several detectors in such a way that the sub-streams overlap. In this case, one detector has the 
overlapping region at the end of its sub-stream and another detector has the same region at 
the beginning of its sub-stream. The latter detector may use the overlap region for training 
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3 23.06.2003 
purposes since the bit decisions can be supplied by the former one. The proposed scheme 
needs no special markers in the data stream, and can be used in applications where the data 
stream is already standardised and there is no possibility to include markers in it 

The at least one access port may provide access to a plurality of data sources 
for writing data to respective ones of the register memory blocks, and to a plurality of data 
processing devices for reading data from the register memory blocks. In particular, the access 
control means may be arranged to provide alternate access for the data sources and the data 
processing devices. The data source accesses may be controlled to cycle through the global 
address space, and the processing device accesses may be controlled to cycle through the 
address range of the respective register memory section. This cyclic read and write operations 
facilitate address translation required for the read-out accesses. 

As an advantageous modification, a buffer memory may be connectable to the 
at least one access port and to the memory, wherein a line width of me buffer memory and 
the memory is selected to be greater or equal the data width of the at least one access port 
multiplied by the sum of read accesses and write accesses per cycle. Thereby, a single-port 
architecture can be provided, where the embedded memory is replaced by a memory block of 
less memory lines and correspondingly more bits per line. The data source or data sources 
provided) the write data to the embedded memory via the buffer memory. In this way, an 
entire line can be written to the embedded memory at each write cycle to thereby reduce the 
access frequency. Multi-port RAM devices can thus be replaced by single-port RAM devices 
of the same capacity, while providing a reduced cycle and access time. Moreover, the number 
of write ports of the at least one access port may now differ from the number of read ports. 
Hence, the number of data symbols accepted by the memory device may differ from the 
number of data symbols produced in every cycle. This provides the advantage that the 
number of processing devices connected to the embedded FIFO memories may vary and does 
not have to be equal to the number of input data streams. 

The access control means may comprise address translation means for aligning 
addresses relating to the read accesses in such a way that they fit to the line width. In 
particular, the access control means may be adapted to transfer write accesses to the buffer 
memory until it is full, and to write one memory line when the buffer memory is full. On the 
other hand, the access control means may be adapted to align read accesses in such a way that 
a block of the line width is read all the time. The address translation means may comprise a 
look-up table or a finite state machine which generates the required output sequence 
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Advantageous modifications of the present invention are defined in the 
dependent claims. 

The present invention will now be described on the basis of preferred 
embodiments with reference to the accompanying drawings, in which: 

Fig. 1 shows schematic functional block diagrams indicating a transfer from 
individual FIFO memory devices to a memory device according to the first preferred 
embodiment with embedded FIFO blocks; 

Fig. 2 shows functional block diagrams indicating a transfer from an 
embedded multi-port memory structure to a single-port memory device according to the 
second preferred embodiment with buffered access; 

Fig. 3 shows a schematic block diagram of an address generation functionality 
used in the first and second preferred embodiments; 

Fig. 4 shows a schematic block diagram of an address translation functionality 
used in the second preferred embodiment; 

Fig. 5 shows a table indicating a memory access schedule according a first 
implementation example; and 

Fig. 6 shows a memory access schedule according a second implementation 

example. 

The preferred embodiments will now be described on the basis of a memory 
architecture where a number Z of FIFO memory blocks are implemented in one unified 
embedded memory block EM. 

Fig. 1 shows a schematic functional block diagram indicating in its left portion 
conventional discrete FIFO memory blocks FIFOl to FBFOZ with dedicated input terminals 
and output terminals. This conventional structure can be transferred to a memory architecture 
according to the first preferred embodiment of the present invention shown in the right 
portion of Fig. 1 . According to the first preferred embodiment, the small FIFO blocks FIFOl 
to FEFOZ are replaced by one unified embedded memory block EM. Thus, if the Z FIFO 
blocks each have a memory capacity B, they can be combined into the embedded memory 
block EM having a capacity of ZxB, wherein the FIFO accesses can be mapped to subsequent 
addresses in the global address space of the embedded memory block EM. It is to be noted 



PHNL030780EPP 



5 23.06.2003 
here that the capacities of the FIFO blocks not necessarily have to be the same. Each FIFO 
block may have an individual specific capacity, which has to be considered in the 
implementation of the individual addressing schemes. The read and/or write accesses can be 
controlled by an access control unit A which supplies corresponding control signals and 
addresses to the embedded memory block EM. Furthermore, data units to be written to or 
read from the embedded memory block EM can be supplied via respective access ports PI to 
PZ. 

Each FIFO block has assigned or allocated a redetermined FIFO memory 
section with corresponding address ranges AS1 to ASz. The address generation functionality 
of the access control unit A may be based on sequential or cyclic counters or sequential or 
cyclic counting frmctions to thereby generate subsequent addresses for reading from and 
writing to the embedded memory block in a FIFO manner, i.e. the first data unit which is 
written to the corresponding FIFO memory section after a corresponding shift delay which 
corresponds to the width of the FIFO memory section. In particular, a key feature of the FIFO 
or other shift register memories is that data units or data words are read and written 
sequentially. 

The system designer may thus replace the conventional small FIFO blocks 
FIFOl to FIFOZ by the unified embedded memory block EM, for example, when designing 
multiplexing or demultiplexing devices which may be used in Viterbi detectors and which 
may comprise an interleaver or de-interleaver functionality, respectively. In these cases, an 
incoming data stream is de-multiplexed and interleaved to allow parallel processing of the 
data by a set of "out-of-the-shelf • Viterbi detectors. The output data streams coming from me 
parallel Viterbi detectors are multiplexed and de-interleaved to form the cutout bit decision 
stream with a format similar to the format of the input stream. A Viterbi^based bit detector 
organised in such a way is transparent to the rest of the system and can be easily integrated 
into the existing data flow within an integrated circuit. Each of Ihe sequential Viterbi 
decoders may be operated at a speed lower than the bit rate. The slower the speed of the 
Viterbi detectors is, the more Viterbi detectors are needed to run in parallel. The number of 
Viterbi detectors grows at least almost linearly with the ratio between the bit rate and the 
speed of the detectors. 

For such detector applications or even for other kinds of appUcations, it can be 
useful to create overlapping regions between the different FIFO memory sections of the 
embedded memory block EM. Then, at least one additional Viterbi detector is needed i 
order to handle the overhead related to the presence of the overlapped regions in the 
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subdivision of the input stream, wherein the same input samples are processed at least twice 
in the regions of overlap. The size of the overlap region, which is required to keep the error 
rate of the proposed parallel Viterbi detector unchanged with respect to the standard 
sequential Viterbi detector, is small. It could be in the order of 50 to 100 input samples for 
the concerned application, e.g., optical disc systems, hard disc drives, digital tape storage 
systems, or the like. A part of the overlap region, typically 30 to 50 samples, can be used to 
initialise a backtracking array while the remaining part of it, typically 20 to 50 samples, can 
be used to initialise a path metrics. The provision of such overlapping regions leads to the 
advantage that the capacity of the embedded memory block EM is less than Z x B. 

Fig. 2 shows a schematic functional block diagram of a second preferred 
embodiment in which the multi-port embedded memory block EM is replaced by a single- 
port embedded memory block EM with an additional buffer memory B. 

In this connection, it is assumed that a multi-port memory block EM of M 
lines with N bits per line, as shown in the left portion of Fig. 2 has to handle X write accesses 
and Y read accesses in every cycle. Therefore, the access ports PI to PZ comprise X write 
ports and Y read ports. This implies that two or more write ports may be assigned to one 
FIFO memory section or that one read port may be assigned to more than one FIFO memory 
section. 

If the line width L of the buffer memory B satisfies the condition L £ X + Y, 
then the embedded memory block EM can be replaced by the single-port memory block 
which then comprises M/L lines with LxN bits per line, if read and write accesses are 
buffered. With the enhanced buffer size of LxN bits on the write side, the accesses can be 
transferred to the buffer memory B instead of the embedded memory block EM until the 
buffer memory B has been filled. Once the buffer memory B is full, one memory line of the 
embedded memory block EM can be written. 

In case, the timing requirements of the buffer memory B do not allow writing 
data into the buffer memory B and copying the buffer content to the embedded memory 
block EM within the same cycle, the width of the buffer memory B can be defined as one 
sample width smaller than the width of the embedded memory block EM, i.e. a buffer width 
of Nx(L-l), so that the last access (which would otherwise have filled the buffer memory B) 
is directly transferred to the embedded memory block EM in parallel with the buffer contents 
without being buffered. However, the line width between the buffer memory B and the 
embedded memory block EM remains LxN. 
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At the read side, the accesses have to be aligned in such a way that a block of 
LxN bit ^ wad all the time. In view of this, the access control unit A is adapted to generate 
addresses in such a manner that the addresses of the accesses are aligned to fit the line size of 
the embedded memory block EM. This can be achieved by introducing an address translation 
unit AT which may be a separate unit or which may be incorporated into the address 
generation functionality of the access control device A. The address translation unit AT may 
be provided or controlled on bom read and write sides. 

This buffering scheme introduces a latency of L cycles added to the access 
time, i.e. data written in a cycle K becomes available only in the cycle K+L+l. For most 
applications, however, the parameter L is rather small. In addition, many applications which 
operate on a stream of data samples are not very sensitive to latency anyway. 

Hence, in the second preferred embodiment, the input data stream or streams 
which result from one ore more data sources provides write data to the embedded memory 
block EM via the buffer memory B having a length corresponding to the enhanced line width 
of the embedded memory block EM. In this way, an entire line which corresponds to several 
memory accesses, can be written to the embedded memory block EM at each write cycle. 
Thereby, the access frequency can be reduced as compared to the multi-port embedded 
memory block of the first preferred embodiment 

Fig. 3 shows a schematic block diagram of the address generation 
functionality in the access control unit A in the first and second preferred embodiments. The 
address generation may be based on simple counters or counter functions CI to CZ for 
generating respective addresses Al to AZ to be used for addressing the FIFO memory 
sections allocated to the access ports PI to PZ. Each counter function is controlled by a clock 
signal Clk and by a reset signal Res for resetting the counter function to a predetermined 
initial address value. In case of an addressing of FIFO memories, as in the first and second 
preferred embodiments, the counter functions CI to CZ may simply be implemented as 
sequential counter functions, e.g., a first counter for generating a pointer to the respective one 
of the address ranges AS1 to Asz and a second counter for generating the respective address 
of the memory line within the selected actual FIFO address range. 

In the second preferred embodiment, an additional counter or counter function 
CB is provided for generating a buffer address AB to be used for addressing the buffer 
memory B. 
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It is noted that any finite state machine (FSM) which is adapted to generate the 
required address sequence can be used instead of the counter functions CI to CZ, CB shown 
in Fig. 3. 

Fig. 4 shows a schematic block diagram of the address translation 
functionality in the access control unit A according to the second preferred embodiment. This 
address translation functionality may simply be implemented based on a look-up table LUT 
in which the required address sequences are stored The look-up table LUT is addressed by 
the addresses Al to AZ generated by the address generation functionality of the access 
control unit A. For each of the addresses Al to AZ a corresponding memory address ABM of 
the embedded memory block EM is stored in the look-up table and supplied to the embedded 
memory block EM. 

However, also the address translation functionality may as well be 
implemented as an FSM which produces the required address sequence. In the case of FIFO 
memory sections, this FSM may again consist of two simple cyclic counters. 

It is noted that the above first and second embodiments can be implemented in 
a fully independent manner and do not require any increase in the clock frequency. 

In the following, a first example of an implementation of the preferred 
embodiments is described with reference to Figs. 5 and 6. The implementation may be used 
in a multiplexing or demultiplexing functionality of a parallel PRML bit detector such as a 
Viterbi detector. In this system, five data streams are processed in parallel, so that five FIFO 
memory sections are required in the embedded memory block EM. At the input side, for 
samples of eight bits each must be read and distributed between the FIFO memory sections 
every clock cycle. In the case of the first preferred embodiment, a multi-port memory 
architecture is obtained with at least one access port, i.e. one read port and one write port 
The access port has to be able to accept four samples and to produce five samples in every 
cycle. 

The required memory capacity depends on the specific application. It is 
assumed that each FIFO memory section has a memory capacity of 300 bytes. In the case of 
non-overlapping FIFO memory sections, a total capacity of the embedded memory block EM 
would be 1500 bytes. In case of a 20 % overlap of the data read, which means that some of 
the input samples are read by two different FIFOs, the address ranges AS1 to ASz in Fig. 1 
overlap, such that a total capacity of 1200 bytes is enough. 

In the case of the second preferred embodiment, the line width of the buffer 
memory B may be 12 bytes, such that 12 memory accesses can be buffered. The embedded 
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memory block EM can be organised as a SRAM (Static RAM) with hundred lines of 96 bits 
each, assuming that one byte consists of 8 bit. Thus, three cycles are required to buffer the 12 
samples and write them to the embedded memory block EM. Consequently, a write access 
occurs once in three cycles. With one read access, 12 samples are read, wherein five read 
accesses occur due to the required processing of five data streams at the output side of the 
embedded FIFO blocks. Thus, the five read accesses occur during every 12 cycles. Due to the 
tact that the lengthening factor has a value of L = 12 in the present case and the number of 
write accesses has a value of X = 4 and the number of read accesses has a value of Y = 5 the 
above requirement L £ X+Y is satisfied in the present first example. 

Fig. 5 shows a table indicating one possible implementation of a suitable 
memory access schedule for the first example. In the first and second cycles (cyles No. 0 and 
1) no access is performed to the embedded memory block EM, and in each cycle four 
samples are written into the buffer memory B. In the third cycle (cycle No. 2) four samples 
are again written into the buffer memory B which is now full so that a write access to the 
embedded memory block EM can be performed by copying the content of the buffer memory 
B mto the embedded memory block EM. In foe fourth cycle (cycle No. 3) a read access can 
be performed to the first processing unit at the output side of foe embedded FIFO blocks eg 
a first PRML decoder, so as to transfer 12 samples to foe first PRML decoder, while four' 
samples are again written into foe buffer memory B. Similarly, in foe fifth cycle (cycle No 
4), a second read access is performed and 12 samples are transferred to a second PRML 
decoder, while four samples are written into foe buffer memory B. In foe sixth cycle (cycle 
No. 5), four samples are written into foe buffer memory B which is now again full, so that its 
content is copied into foe embedded memory block EM in a new write access. Then, in foe 
seventh and eight cycles (cycle No. 6 and cycle No. 7), respective read accesses are 
performed in order to transfer 12 samples to a third PRML decoder and to a fourth PRML 
decoder, while four new samples are written into foe buffer memory B in each of foe two 
cycles. Then, in foe ninth cycle (cycle No. 8), four samples are written into foe buffer 
memory B which is now full again, and its content is copied into foe embedded memory 
block EM during another write access. In foe tenth cycle (cycle No. 9), a read access is 
performed again, wherein 12 samples are transferred to a final fifth PRML decoder while 
four new samples are written into foe buffer memory B. In foe following eleventh cycle 
(cycle No. 10), no access to foe embedded memory block EM is performed at all, while four 
samples are written into foe buffer memory B. Finally, in foe twelfth cycle (cycle No 1 1) 
four additional samples are written into foe buffer memory B which is then foil again, and a 
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third write access is performed to copie the content of the buffer memory B into the 
embedded memory block EM. 

Accordingly, every twelve cycles there are four write accesses and five read 
accesses. However, this does not mean that the memory lines of the read accesses are located 
more and more ahead of the memory lines of the write accesses. The reason for this is that, 
from time to time, the same data are written by two different ones of the PRML decoders. 

In general, the line width of the buffer memory B and the embedded memory 
block EM not necessarily correspond to the sum of the number of write accesses and read 
accesses. Actually, the parameter L has to be at least this sum. However, a higher value can 
be chosen for the parameter L e.g. if this value better matches to a suitable memory size of 
the embedded memory block EM, for example 1200 bytes. 

In feet, in the above first example, written data becomes available only after 
13 cycles. It has to be noted that in cycle No. 3 of Fig. 5, the read data does not correspond to 
the data written one cycle earlier. There is always a delay of at least 13 cycles. In addition, 
after power-on, some time is needed to write data into 1he embedded memory block EM 
before "useful" data can be read. 

In a practical implementation, the resulting embedded memory block EM of 
the second preferred embodiment, excluding any buffers and address generators, has an area 
of 0.15 mm 2 and consumes 0.951 mW/Mhz in CMOS18 process. A comparable dual-port 
memory which suits the requirements of the application, i.e. 240 lines with 40 bits/line, 
requires an area of 0.46 mm 2 and consumes 0.21 1 mW/Mhz per access port. In addition, the 
usage of the single-port embedded memory block EM allows an IC implementation with less 
advanced and thus cheaper CMOS process. An implementation of five FIFOs would result in 
an area of 0.44 mm 2 and a power consumption of 0.944 mW/Mhz. The advantages of the 
proposed technique are thus clearly visible. 

Fig. 6 shows another suitable memory access schedule for a second example 
of the preferred embodiments. In this second example, three output processing devices Ol to 
03 are connected to the embedded memory block EM which in the present second example 
comprises a total memory capacity of nine lines L0 to L8. Consequently, each FIFO section 
consists of three lines. In particular, the first output processing device Ol uses lines L0 to L2, 
the second output processing device 02 uses lines L3 to L5, and the third output processing 
device 03 uses lines L6 to L8. Every even cycle, a write access takes place, and every odd 
cycle, a read access. The write address generation ftmctionality of the address translator AT 
generates an address A = C/2 mod 9, wherein C denotes the cycle number and "mod" 
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denotes the mathematical modulo operation. Furthermore, the read address generator 
functionality of the address translator AT generates a read address A - 3 x (n mod 3) + INT 
(n/3 mod 9), wherein n = (C-l)/2. 

Hence, the write address cycles through the memory in another order than the 
read address. Thus, the output samples are not necessarily in the same order as the input 
samples. According to Fig. 6, a reading scheme "read L0" — "read LI" - "read L2" is used 
for the first output processing device 01. Furthermore, a reading scheme "read L4" -> "read 
L5» — "read L3" is used for the second output processing device 02, while the initially read 
lines L4 and L5 are discarded after imtialisation. For the third output processing device 03, 
the reading scheme is "read L7» - "read L8" - "read L6", while the initially read lines L7 
and L8 are discarded. The discarding functionality may be implemented by inserting some 
dummy data which does not represent any meaningful data in the beginning of the stream. 
This discarding functionality is only relevant at the start after reset, when the embedded 
memory block EM is empty. The initially discarded lines are indicated in Fig. 6 by 
correspondingbrackete. Thus, in the access scheme of Fig. 6, the second and third output 
processing devices 02 and OS receive their first "useful" data after a short delay, whereas the 
first output processing device Ol receives it immediately. 

As soon as every address was written at least once, the system has passed the 
imtialisation and has reached a stable state, so that no discarding is required anymore. Also in 
the Viterbi detector example, this problem can be solved by introducing some "senseless" or 
dummy data in the beginning for imtialisation purposes as already mentioned above. 

As can be gathered from Fig. 6, lines L0 to L8 are successively written in 
every even cycle starting with the first cycle (cycle No. 0). In the second cycle (cycle No. 1), 
line L0 is read and supplied to the first output processing device Ol . In the fourth cycle 
(cycle No. 3), line L4 is read and supplied to the second output processing device 02, while 
the content is discarded during the initial first reading operation. In the sixth cycle (cycle No. 
5), line L7 is read and supplied to the third output processing device 03, while the content is 
again discarded during the initial first reading operation for this third output processing 
device 03. In the eighth cycle (cycle No. 7), line LI is read and supplied to the first output 
processing device Ol. Furthermore, in the tenth cycle (cycle No. 9) line L5 is read and 
supplied to the second output processing device 02. Also this content is discarded during the 
first reading operation. The same applies to the reading of line L8 in the twelfth cycle (cycle 
No. 1 1), which is supplied to the third output processing device 03. Then, during the 
fourteenth cycle (cycle No. 1 3), line L2 is read and its content is supplied to the first output 
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processing device Ol. In the sixteenth and eighteenth cycles (cycles No. 15 and 17), 
respective lines L3 and L6 are read and supplied to the respective second and third output 
processing devices 02 and 03. Then, the procedure starts again with the first cycle (cycle 
No. 0). 

It is noted, that the present invention is not restricted to the above access 
schemes shown in Figs. 5 and 6. Any suitable addressing scheme can be used in dependence 
on a desired access scheme. Moreover, any number of read accesses, write accesses and line 
numbers can be implemented, while the above width requirement of the buffer memory B has 
to be considered in the second preferred embodiment. 

The present invention can be applied to any parallel shift register structure in 
optical disc systems, such as Portable Blue (PB) or Small Form Factor Optical (SFFO), 
DVD, DVD+RW, DVR, or any future optical disc system. Moreover, the present invention 
can be applied to magneto-optical systems, hard disc systems, digital tape storage systems, 
satellite and mobile communication systems, image processing systems and the like. In case 
of PRML or Viterbi processing systems, the present invention can be applied in the 
demultiplexing or interleaving functionality at the input side or as well in the multiplexing or 
de-interleaving functionality at the output side. The preferred embodiments may thus vary 
within the scope of the attached claims. 
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1 • A memory device comprising: 

a. a memory (EM) having at least two predetermined register memory sections 
addressable by respective address ranges (AS1 to ASz); 

b. at least one access port (PI to PZ ) for providing access to said memory (EM); 
and 



c. 



access control means (A) for addressing said memory (EM) so as to operate 
said register memory sections as shift registers and to map shift register accesses of said at 
least one access port (PI to PZ) to predetermined addresses in the global address space of 
said memory (EM). 

2. A device according to claim 1, wherein said access control means (A) 

comprises at least one address counter. 

3- A device according to claim 1 or 2, wherein said address ranges (AS1 to ASz) 
comprise overlapping regions of a predetermined size. 

4- A device according to any one the preceding claims, wherein said at least one 
access port (PI to PZ) provides access to a plurality of data sources for writing data to 
respective ones of said register memory sections, and to aplurality of data processing devices 
for reading data from said register memory sections. 

5. A device according to claim 4, wherein said access control means (A) is 

arranged to provide alternate access for said data sources and said data processing devices. 

6- A device according to claim 4 or 5, wherein data source accesses are 

controlled to cycle through said global address space, and processing device accesses are 
controlled to cycle through the address range of a respective register memory section 
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7. A device according to any one of the preceding claims, further comprising a 
buffer memory (B) connectable to said at least one access port (PI to PZ) and to said memory 
(EM), wherein a line width of said buffer memory (B) and said memory (EM) is selected to 
be greater or equal the data width of said at least one access port multiplied by the sum of 

5 read accesses and write accesses per cycle. 

8. A device according to claim 7, wherein said memory (EM) is a single-port 
memory. 

10 9, A device according to claim 7 or 8, wherein said at least one access port (PI to 

PZ) comprises a plurality of write ports and a plurality of read ports, wherein the number of 
write ports differs from the number of read ports. 

10. A device according to any one of claims 7 to 9, wherein said buffer memory 
15 (B) is arranged to buffer read and write accesses of said at least one access port (PI to PZ). 

11. A device according to any one of claims 7 to 1 0, wherein said address control 
means (A) comprises address translation means (AC) for aligning addresses relating to said 
read accesses in such a way that they fit to said line width. 

20 

12. A device according to claim 11, wherein said address translation means (AC) 
comprises a look-up table (LUT). 

13. A device according to any one of claims 7 to 12, wherein said access control 
25 means (A) is adapted to transfer write accesses to said buffer memory (B) until it is full, and 

to write one memory line when said buffer memory (B) is full. 

14. A device according to any one of claims 7 to 13, wherein said address control 
means (A) is adapted to align read accesses in such a way that a block of said line width is 

30 read all the time. 

15. A device according to any one of the preceding claims, wherein said at least 
two predetermined register memory sections are operated as FIFO memory sections. 
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16. A demultiplexing device for demultiplexing a plurality of input data streams 
and supplying demultiplexed data streams to a plurality of data processing units, said input 
data streams being supplied to a memory device as claimed in any one of the preceding 
claims. 

5 

17. A device according to claim 16, wherein said demultiplexing device comprises 
a PRML-based interleaver functionality. 

18. A multiplexing device for multiplexing data streams supplied from a plurality 
10 of data processing units, and for generating multiplexed output data streams, said date 

streams being supplied to a memory device as claimed in any one of claims 1 to 14. 

19. A device according to claim 1 8, wherein said multiplexing device comprises a 
PRML-based de- interleaver functionality. 

15 

20- A method comprising the steps of: 

providing a memory (EM) having at least two predetermined register memory 
sections addressable by respective address ranges (AS1 to ASz); 

providing at least one access port (PI to PZ) for providing access to said 
20 memory (EM); and 

providing access control means (A) for addressing said memory (EM) so as to 
operate said register memory sections as shift registers and to map shift register accesses of 
said at least one access port (PI to PZ) to predetermined addresses in the global address space 
of said memory (EM). 



PHNL030780EPP 



10 



16 23.06.2003 

ABSTRACT: 



The present invention relates to a memory device comprising a memory (EM) 
having at least two predetermined register memory sections addressable by respective 
address ranges ASl-ASz) and at least one access port (Pl-PZ) for providing access to said 
memory (EM). Furthermore, access control means (A) are provided for addressing said 
memory (EM) so as to operate said register memory sections as shift registers and to map 
shift register accesses of the at least one access port (PI to PZ) to predetermined addresses in 
the global address space of the memory (EM). In this way, it is possible to combine a 
plurality of FIFO memories in a single addressable memory device. This implementation is 
favourable in view of power consumption and area. Furthermore, by introducing a buffer 
memory, a multi-port memory device can be replaced by a single-port memory device of the 
same capacity. This advanced implementation also provides a reduced cycle and access time. 

Fig. 1 
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